A new method to distinguish non-voice and voice in speech recognition

نویسنده

  • LI CHANGCHUN
چکیده

we addressed the problem of remove the non-voice disturbance in speech recognition. It is always a big problem that the system will wrongly recognize our natural sound, like cough, breath, or sound of lip, nose as speech input and give “recognized ” words output, when we use a speech recognition system. As we know, such non-voice speech is unavoidable for natural speaking, and if we don’t supply effective control, the performance often drops to unacceptable level [1]. This paper puts forward a new method to detect fundamental frequency, and use it to distinguish real speech input and non-voice sound, like breath, lip, or noise by people walking by. Applying this method into our command recognition system, we get good results and make the system very robust and could be used in real life. Key-words Voice distinction Auto-relation Fundamental frequency endpoint detection

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Voice-based Age and Gender Recognition using Training Generative Sparse Model

Abstract: Gender recognition and age detection are important problems in telephone speech processing to investigate the identity of an individual using voice characteristics. In this paper a new gender and age recognition system is introduced based on generative incoherent models learned using sparse non-negative matrix factorization and atom correction post-processing method. Similar to genera...

متن کامل

A New Algorithm for Voice Activity Detection Based on Wavelet Packets (RESEARCH NOTE)

Speech constitutes much of the communicated information; most other perceived audio signals do not carry nearly as much information. Indeed, much of the non-speech signals maybe classified as ‘noise’ in human communication. The process of separating conversational speech and noise is termed voice activity detection (VAD). This paper describes a new approach to VAD which is based on the Wavelet ...

متن کامل

The Effects of Size and Type of Vocal Fold Polyp on Some Acoustic Voice Parameters

Background: Vocal abuse and misuse would result in vocal fold polyp. Certain features define the extent of vocal folds polyp effects on voice acoustic parameters. The present study aimed to define the effects of polyp size on acoustic voice parameters, and compare these parameters in hemorrhagic and non-hemorrhagic polyps.Methods: In the present retrospective study, 28 individuals with hemorrha...

متن کامل

طراحی یک روش آموزش ناموازی جدید برای تبدیل گفتار با عملکردی بهتر از آموزش موازی

Introduction: The art of voice mimicking by computers, has with the computer have been one of the most challenging topics of speech processing in recent years. The system of voice conversion has two sides. In one side, the speaker is the source that his or her voice has been changed for mimicking the target speaker’s voice (which is on the other side). Two methods of p...

متن کامل

Patient-Based Assessment of Effectiveness of Voice Therapy in Vocal Mass Lesions with Secondary Muscle Tension Dysphonia

Introduction: Use of patient-based voice assessment scales is an appropriate method that is frequently used to demonstrate effectiveness of voice therapy. This study was aimed at determining the effectiveness ofvoice therapy among patients with secondary muscle tension dysphonia (MTD) and vocal mass lesions.   Materials and Methods: The study design was prospective, with within-participant repe...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002